04. Comparison

Lesson 4 04 Comparison

Definition

When your problem statement requires the comparison of two data features or cohorts

_ Example_

Problem statement reads: What are the demographic differences between the Top 10% of FIFA players by market value and the remaining 90% of players?

Which visualization will work best?

  1. Box plots will provide a comprehensive picture of how the two cohorts are comparing:
  • Center will tell if on average the cohorts are similar
  • Spread will tell you if they vary differently
  • Shape (symmetry, skewness) will indicate any asymmetry
  • Unusual features (outliers, missingness)

If you need a refresher on box plots, feel free to explore the refresher course on descriptive statistics available in our pre-requisite courses within the classroom.